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(57) ABSTRACT 

This invention relates to the automated assistance of a 
telephony call center agent comprising a method and system 
for providing dialogue suggestions to an agent during an 
agent caller interaction. A prior art solution provides data 
based on analyzed text from one or other of the conversa- 
tions but does not offer any useful information based on the 
performance of the agent or on the state of the interaction. 
A method of interaction in a voice response application 
comprising: receiving a voice signal in a call center; iden- 
tifying whether the caller or a call center agent is the 
originator of the voice signal; converting the voice signal 
into computer readable text; identifying a key word such as 
a confrontational phrase e.g. 'what are you talking about' in 
the converted computer readable text; and providing a 
different suggestion depending on whether the originator is 
the caU agent or the caller. For instance, a suggestion if the 
agent made the confrontational phrase would be to use a less 
confrontational phrase next time such as 'can you explain 
that again'. A suggestion if the caller made the confronta- 
tional phrase would be to counter with a Til try to explain 
that better*. 

18 Claims, 4 Drawing Sheets 
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CALL CENTRE AGENT AUTOMATED 
ASSISTANCE 

FIELD OF INVENTION 5 

This invention relates to the automated assistance of a 
telephony call centre agent. In particular it relates to a 
method and system for providing dialogue suggestions to an 
agent during an agent caller interaction. 

BACKGROUND OF INVENTION 

A call centre can be defined as a place in a company or 
business that handles incoming and/or outgoing calls from/ 
to its customers in support of its day-to-day operation. This 
can be a telemarketing area, where the employees make 
outgoing calls to try and sell the company's products. It can 
be a service area that receives incoming calls from its 
customers for repair or maintenance of the company's goods 
or services, A call centre will have a telephone system which 
may be as simple as a small single-line phone, increasing in 
complexity up to a large multi-node PBX. A call centre 
would normally have a computerized system for tracking, 
logging and recording call details, although some simply use 
paper forms. It may have one operator or agent, or it may ^5 
have many, depending on the size of the company or 
business. 

A call centre company needs to ensure quality control on 
their agents performance in an interaction with a caller in 
order to maintain customer satisfaction and to keep an 3Q 
acceptable call rate through each agent. One way to educate 
agents is to record their interaction with a caller and have a 
supervisor go through the recording and discuss possible 
problems and area of improvement. However this education 
is performed off line and is expensive in human resource 35 
terms. Much cheaper is an automatic prompt system offering 
hints and suggestions to the caller. 

British patent application GB2333 173 discloses an intel- 
ligent human computer interface providing real time context 
data in relation to automatically recognized input signals. 40 
The system monitors in real time the operator and caller 
speech input to extract a subject matter. Once the subject has 
been identified an action is determined to provide useful data 
to the agent such as a database link. This may involve 
querying one or more databases with the extracted subject 45 
and providing visual data based on the result. This prior art 
solution provides data based on analyzed text from one or 
other of the conversations and does not offer any useful 
information based on the performance of the agent or on the 
state of the interaction. 50 

SUMMARY OF INVENTION 

In one aspect of the present invention there is provided an 
interactive voice response application as claimed in claim 1 
and a method of interactive voice recognition as in claim 10. 55 
Therefore the invention provides a method of making sug- 
gestions and comments on the interaction between the agent 
and the caller that has only been previously possible with a 
human supervisor. The invention provides a suggestion to 
the agent that can be used to counter a particular (e.g. 60 
confrontational) key word spoken by a caller. The agent can 
be provided with one or more possible alternatives so that 
they can chose the most appropriate. Furthermore the inven- 
tion provides a comment to the agent to his own dialogue so 
that if the agent is being is using inappropriate language (e.g. 65 
confrontational) a polite alternative or alternatives can be 
suggested for next time. 
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Advantageously the means for providing a key word 
suggestion comprises: means for providing an alternative 
key word if the originator is the call centre agent; and means 
for providing a counter key word or phrase if the originator 
is the caller. Preferably each key word is stored in a field of 
a record in a key word table, a further field stores an 
alternative word and a further field stores the counter word 
so that once the keyword is known the alternative or counter 
words may be acquired by lookup. These provides a simple 
and quick method of providing supervision to an interaction. 

In the preferred embodiment the IVR application further 
comprises means for placing comments and suggestions on 
the caller workstation as the call is being placed so that the 
agent can have suggestions in real time as the call is going 
through and helpful suggestions can be chosen from to aid 
the interaction. 

In the preferred embodiment the IVR application further 
comprises means for storing the dialogue text and the 
comment and suggestion text together for later retrieval so 
that an agent can review past performance, see the text 
version of the dialogue and compare the suggestions and 
comments. 

In the preferred embodiment the IVR application further 
comprises means for grading each computer readable text 
with respect to the key word text so that a total grade for the 
interaction may be found. A further field in the key word 
table stores most advantageously stores a value for the key 
word being an subjective grading of the strength of inap- 
propriateness or in the present embodiment a subjective 
grading of the strength of confrontation of the keyword. In 
this embodiment each key word in a dialogue line takes 
away the value of that keyword from the total grade. The 
IVR application further comprises means for storing the 
individual grade and the overall grade together with the 
dialogue text. 

A key word means a single word or groups of words that 
form a phrase. 

In this description the word 'caller' represents not only a 
party making an incoming telephone call but also a party that 
is the recipient of a telephone call. 

BRIEF DESCRIPTION OF DRAWINGS 

In order to promote a fuller understanding of this and 
other aspects of the present invention, an embodiment will 
now be described, by way of example only, with reference 
to the accompanying drawings in which: 

FIG. 1 is a schematic representation of a call centre of the 
embodiment; 

FIG. 2 is an example of a word table of the embodiment; 
FIG. 3 is an example of a dialogue store of the embodi- 
ment; and 

FIG. 4 is a process flow of the embodiment. 

DETAILED DESCRIPTION OF PREFERRED 
EMBODIMENT 

Referring to FIG. 1 a call centre 10 comprises: a PC based 
computer telephony platform 12; a number of PC based 
computer clients or agent workstations 14 connected to the 
telephony plalfonm 12; a local area network (LAN) 16 
connecting the workstations 14 and the telephony platform 
12; a telephony switch (PBX) 20; a control line 21 connect- 
ing the telephony platform 12 with the switch 20; telephone 
lines 23 connecting the telephony platform 12 with switch 
20; agent telephones 22 conesponding to each of the work- 
stations 14 connected to the switch 20. Additional telephone 
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lines connect switch 20 to public telephony network 18 and <:a=dialpgue=manager=(DM)=seiver=(not=sbown);==a deve lop- 
switch 20 to agent phones 22. ment work area (not shown); an application manager (not 

The switch 20 makes, breaks or changes the connections shown); a node manager (not shown); a general application 

between telephone lines in order to establish, terminate, or programming interface (API) 60; voice application 62; word 
change a telephone call path; it is typically a private branch 5 table 64; and dialogue store 66. API 60 is a conduit for all 

switch residing on the same premises as the telephony communications between the component parts of the voice 

platform 12. The switch 20 would suitably be a Siemens processing software 44. A server is a program that provides 

Hicom* 300 but could be one of many suitable switches services to the voice response application 62 or any other 

provided amongst others by Lucent, Nortel or Alcatel. The client. The modular structure of the voice processing soft- 
switch 20 provides network information to the telephony ^0 ware 44 and the open architecture of the general server 

application such as Aisn (answer number identification, also interface API 60 allows development of servers that are 

known as Caller Line Identification (CLQ) and DNl (dialled unique to specific applications. A user-defined server can 

number identification). It also allows telephony platform 12 provide a bridge between the voice processing software and 

to perform intelligent dialing functions and to transfer calls. another product. 

Each workstation 14 is typically a Pentium microproces- Telephony server 52 connects to the network interface 46 

sor based PC with 32M bytes of memory, 4 Gbytes of hard and provides telephony functionality to the voice response 

drive, keyboard, mouse and VDU connected to the LAN 16 ap^ication^_ — 

using an Ethernet card, A suitable operating system is jThe automatic speech recognition (ASR) server 54 -is 

Microsoft NT for workstations running the workstation large-vocabulary, speaker-independent bontinuous function 

application. The workstation application sends and receives based-on_IBM-Via-Voice-and^iising.DSB^48"to"performltf^ 

messages from the switch through the LAN 16 and tele- preliminary frequency analysis on the voice signal. The 

phony application using an application programming inter- voice signal is converted into frequency coefficients by the 

face which is part of the telephony application. DSP 48 which are passed on to the ASR server 54 to perform 

Telephony platform 12 comprises: a personal computer Markov analysis and phoneme matching to acquire 

with an Industry Standard Architecture (ISA) bus or a machine -readable text. 

Peripheral Component Intercoimect (PCI) bus 40, running The development work area allows the creation and 
Microsoft Windows NT; call processing software 42; voice modification of a voice-processing application. The appli- 
processing software 44; one or more Dialogic or Aculab cation manager executes the voice response application. The 
network interface cards 46 for connecting the required type node manager allows monitoring of the status of application 
and number of external telephone lines 23; one or more sessions and telephone lines and allows the issue of corn- 
Dialogic voice processing cards 48; a System Computing mands to start and stop application sessions. 
Bus (SCbus) 50; and LAN network interface 51. SCbus 50 Voice application 62 controls the interaction between the 
is a dedicated voice data bus which connects the network voice processing software 44 and a caller. Applications are 
card 46 and the DSP card 48 so that data flow congestion on written in Telephony Java, which incorporates the power and 
the PCI system bus is avoided and voice processing speed is ease-of-use of the Java programming language. The voice 
increased. Telephony platform 12 supports up to 60 El or 48 processing system can run up to sixty applications simulta- 
Tl telephone lines 23 connected through telephony network neously ranging from one voice response application run- 
interface 46. If call volumes require more than 60 El or 48 ning on all sixty lines to sixty different voice applications 62 
Tl lines, additional voice processing systems can be con- each running on a separate line. 

nected together through the LAN 16. y^e word table 64 comprises a plurality of records having 
Call processmg software is suitably based on IBM Call- four fields, this is shown in more detail in FIG. 2. Each 
Path software for controlling the interactions between the record represents a confrontational statement, phrase or 
agent workstations and agent telephones. word. The first field contains the text of the confrontational 
The voice processing software 44 comprises IBM*s Voice 45 statement or phrase. The second field contains the text of a 
Response for Windows (previously known as IBM non-confrontational alternative for the confrontational state- 
DirectTalk/2) is a powerful, flexible, yet cost-effective ment or phrase, Tlie third field contains the text of a 
voice-processing software for the Windows NT operating non-confrontational counter phrase to the original confron- 
system environment. Although the embodiment is described tational phrase. The fourth field contains an absolute numeri- 
forWindows, an equivalent platform is also available for the 50 cal value representing the strength of the confrontational 
UNIX environment from the IBM Corporation, in which phrase. For instance, confrontational phrase 'what are you 
case a maximum of 12 digital trunks per system (360 El or talking about?' has a confrontation value 2. The confronta- 
288 Tl channels) may be supported. Used in conjunction tion value is subtracted from confrontational score for the 
with voice processing hardware. Voice Response can con- interaction. A non-confrontational alternative is *rm not 
nect to a Public Telephone Network directly or via a PBX. 55 sure 1 understand you?* which the system might suggest if it 
It is designed to meet the need for a fully automated, detects that the agent has made the confrontational state- 
versatile, computer telephony system. Voice Response for ment. If the agent uses the non-confrontational statement 
Windows NT not only helps develop voice applications, but then the confrontational value is added to the confrontational 
also provides a wealth of facilities to help run and manage score for the interaction. A counter statement to the con- 
them. Voice Response can be expanded into a networked go frontational statement might be "ITl try to explain that 
system with centralized system management, and it also better", which the system suggests if it detects that the caller 
provides an open architecture, allowing customization and has made a confrontational statement. If the agent uses the 
expansion of the system at both the application and the counter statement then the confrontational value is added to 
system level. the confrontational score for the interaction to give an 

The voice processing software 44 comprises: a telephony 65 improved score. 

?xfZ?L?^» jti auto matic speecbMT^cogDition (ASR) server 54; The dialogue store 66 comprises a plurality of records for 

a natural langu ageunderstanding_(NLU) server (hot shown) ; storing the interaction and the interaction score and is 
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illustrated with an example in FIG. 3. Each record contains j Step 209. The voice application checks whether it is tfie 

three fields. The first field contains the number of the agent or cdler who has taken the confrontation value, over 

dialogue line so that the dialogue may be viewed in the order the threshold. If it is"th"e"agent then the voice application 

in which it happened. The second field contains an indicator selects the polite alternative word or phrase in the records 
of whether it is the caller or the agent's dialogue. The third 5 and flashes it on the screen to the agent as a^waraing. 

field contains the text of the dialogue. The fourth field Furthermore the selection is stored in the dialogue store^66— 

contains a voice appHcaUon suggestion or comment. The -^IS^S ^^R text and dialogue-reference numberrif / 

fifth field contains a cumulative interaction score. caller has spoken the confrontational word or phrase then> 

T-u *u J f »u u J- * J f 11 -^u Uhe-_voice application selects the counter confrontational 

The method of the embodiment proceeds as follows with / rrr^, u a a u *i. * »u ^ * 

f FIG 4 10 /w°^'L^I_PH???i359J^°^s-on~'the-scre 

reierence o Furthermore tfie'seiection is stored in the dialogue store 66 

A caU starts with either an agent making a outbound call along with the ASR text and dialogue reference number, 

or a caller making an inbound call (step 201), Either way the summary this invention relates to the automated assis- 

PBX 20 sends a message over control line 21 to the network tance of a telephony caU centre agent comprising a method 

interface 46 which passes the message to the call processing and system for providing dialogue suggestions to an agent 
software 42 and the voice processing software 44, If the call 15 ^^^ing an agent caller interaction. A prior art solution 

is inbound the call processing software 42 directs the call to provides data based on analyzed text from one or other of the 

an appropriate agent phone 22 by signaling the PBX 20 to conversations but does not offer any useful information 

initiate the business software on the agent's workstation based on the performance of the agent or on the state of the 

with caller details. For both.inboun d and outb ou nd calj sjhe interaction. A method of interaction in a voice response 
voice^piocessmg softwarei44 instantiates.a^^oice^p 20 application comprising: receiving a voice signal in a call 

r62'to monitor the voice jnteraction. centre; identifying whether the caller or a call centre agent 

The'agent voicesignal and caller voice signal are copied is the originator of the voice signal; converting the voice 

at the PBX 20 and transmitted over the telephone lines 23 to signal into computer readable text; identifying a key word 

the telephony platform (step 202). This is transparent to the such as a confrontational phrase e.g. *what are you talking 

caller and the agent as the original voice signals continue about' in the converted computer readable text; and provid- 

through the PBX 20 to the respective recipients. The voice ing a different suggestion depending on whether the origi- 

application associates the voice signal with a reference nator is the call agent or the caller. For instance, a suggestion 

number incremented from the last reference number. The if the agent made the confrontational phrase would be to use 

PBX supplies an indicator of the originator of the voice a less confrontational phrase next time such as *can you 

signal and this is associated with the voice signal reference explain that again'. A suggestion if the caller made the 

number. confrontational phrase would be to counter with a U'll try to 

The^voice signal firom"either party is transmitted through explain that better'. 

the^network.intetf^ej46^to the^;S€B^^ "Windows NT" is a trademark of Microsoft Corporation 

^DSP48rFrequency coeflScients-are-passed-to ASR-server 54 and "Hicom" is a trademark of Siemens, 

and^nextzstring^is passed on to voice application 62 (step Although we have used confrontational and non- 

2p3) and assodated >yith tbe.rcferencenumber of - the^voice confrontational key words in this embodiment there is no 

signal.^^ reason why other oppositions could be used such as polite 

The voice"applicati6n"62 searches the text foFocctirrences and not polite or politically correct or non-politically cor- 

of words or phrases in the^word-table"64-and acquires^core rect. Acall centre supervisor could use different word stores 

for any hits (step 204)f'The confrontation value associated depending on what aspect of training was desired. Altema- 

with-each"record-is~an absolute value. The value is made tively different word stores could be used simultaneously to 

negative if the hit is in the confrontational list and positive give an all round training to he want, 

is in the non-confrontational alternative word list or the Although the embodiment has been described with 

non-confrontation counter word list. The value is the same respect to a key phrase it is envisaged that the key feature 

polarity for both the caller and the agent since the value is could be a particular tone of voice to denote an aggressive 

a measure of the whole interaction not just one side of the style. The ASR could be programmed with a dictionary of 

interaction. words which could distinguish between aggressively spoken 

The converted text is saved in the dialogue field in the words and quietly spoken words so that the key feature of a 

dialogue store 66. phrase or voice signal could be an aggression characteristic. 

The voice application 62 updates a variable associated Although the interaction is described in the embodiment 

with the interaction (step 205) and the variable is saved in for a call centre it is not so limited and could easily be 

the score field of the dialogue store for that dialogue. At the applied to a standalone training system for other forms of 

start of the interaction the interaction value is reset to zero. agents such as receptionists, secretaries. It is envisaged that 

If the call ends the PBX 20 will send a message via the 55 it would be easy to adapted the methods outlined here for 

network interface 46 to the voice application and the end call sales training techniques, therapy methods, role play and 

routine is entered (step 206). management techniques. 

That particular interaction has an identification number Now that the invention has been described by way of a 

and the dialogue store 66 is associated with the interaction preferred embodiment, various modifications and improve- 
number so that a complete history of the interaction along 60 ments will occur to those person skilled in the art. Therefore 

with a cumulative interaction score can be retrieved for later it should be understood that the preferred embodiment has 

examination (step 207). been provided as an example and not as a limitation. 

The interaction score is checked (step 208) against a What is claimed is; 

threshold value and if so moves on to the agent action step 1- A sj^tem comprising: 
(209). If below the threshold value the interaction is deemed 65 means for receiving a voice signal; 

not confrontational enough to warrant any action and the first identifying means for identifying whether a caller or 

process starts over with a new voice signal input at step 202. a call center agent is the originator of the voice signal; 
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second identifying means for identifying a key phrase, in 
the voice signal; and 

means for selecting one of a plurality of suggested 
responses depending on the first and second identifying 
means, wherein said means for selecting one of a 5 
plurality of suggested responses comprises first means 
for selecting a suggested response as an alternative to 
the response given if the originator is the call center 
agent and second means for selecting a suggested 
response as counter to the response given if the origi- 
nator is the caller; and 

providing means for providing the selected suggested 
response to the agent so that the agent may use the 
suggested response immediately or at a future time. 

2. A system as in claim 1 further comprising means for 
converting the voice signal into computer readable text. 

3. A system as in claim 2 wherein the providing means 
stores the computer readable text and suggested response 
together for later retrieval. 

4. A system as in claim 1 wherein each key phrase is 
stored in a field of a record in a key phrase table, a further 
field stores an alternative phrase and a further field stores a 
counter phrase. 

5. A system as in claim 1 wherein said providing means 
places comments and suggestions on a caller workstation as 
a call comprising said voice signal is being placed. 

6. A system in claim 1 further comprising means for 
grading each voice signal interaction with respect to the key 
phrase and acquiring a total grade for the interaction. 

7. A method comprising: 
receiving a voice signal; 

identifying whether a caller or a call center agent is the 
originator of the voice signal; 

identifying a key phrase in the voice signal; and 35 

selecting one of a plurality of suggested responses 
depending on the first and second identifying means, 
wherein a first suggested response is selected as an 
alternative to a response given if the originator is the 
call center agent and a second suggested response is 
selected as a counter to a response given if the origi- 
nator is the caller; and 

providing the selected suggested response to the agent so 
that the agent may use the suggested response imme- 
diately or at a future time. 

8. A method as in claim 7 further comprising a step of 
converting the voice signal into computer readable text. 

9. A method as in claim 8 wherein each key phrase is 
stored in a field of a record in a key phrase table, a further 
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field stores an alternative phrase and a further field stores a 
counter phrase. 

10. A method as in claim 8 further comprising storing the 
computer readable text and suggested response together for 
later retrieval. 

11. A method as in claim 7 further comprising placing 
comments and suggestions on a caller workstation as a call 
comprising said voice signal is being placed. 

12. A system method as in claim 7 further comprising 
grading each voice signal interaction with respect to the key 
phrase and acquiring a total grade for the interaction, 

13. A computer program product, stored on a computer- 
readable storage medium, for executing computer program 
instructions to cany out the steps of: 

receiving a voice signal; 

identifying whether a caller or a call center agent is the 
originator of the voice signal; 

identifying a key phrase in the voice signal; and 

selecting one of a pluraUty of suggested responses 
depending on the first and second identifying means, 
wherein a first suggested response is selected as an 
alternative to a response given if the originator is the 
call center agent and a second suggested response is 
selected as a counter to a response given if the origi- 
nator is the caller; and 

providing the selected suggested response to the agent so 
that the agent may use the suggested response imme- 
diately or at a future time. 

14. A computer program product as in claim 13 further 
comprising converting the voice signal into computer read- 
able text. 

15. A computer program product as claimed in claim 14 
wherein each key phrase is stored in a field of a record In a 
key phrase table, a further field stores an alterative phrase 
and a further field stores a counter phrase. 

16. A computer program product as claimed in claim 14 
further comprising storing the computer readable text and 
suggested response together for later retrieval. 

17. A computer program product as claimed in claim 13 
further comprising placing comments and suggestions on a 
caller workstation as a call comprising said voice signal is 
being placed. 

18. A computer program product as claimed in claim 13 
further comprising grading each voice signal interaction 
with respect to the key phrase and acquiring a total grade for 
the interaction. 
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